DE-HTML.EXE is a simple MSDOS utility which reads as input an HTML
file, and writes as output a text file with all of the HTML tags
removed. Run with the optional commandline switch /url the output
will include all tags containing URLs ("href=http..."). Use the
switch /len=nn to set maximum output line length.
DE-HTML was written by request. It is available free of charge,
however the source code is copyright 1995-1996 by Bamboo Software.
History:
-------
Version 1.1b: 12/09/95 - Parse a few character entities for Paul Gorodyansky.
Version 1.2: 03/11/96 - Added new option /len=nn (maximum line
length for output), and wildcard filespec processing ("*.htm" for
example) after suggestions by Dan Goodman.
Version 1.3: 06/29/96 - Added minimal table support. Recognizes /th,
/td and /tr tags and new /del option lets user select field delimiter
character (defaults to semicolon). Suggested by Jonathan Rosenne.
Version 1.3a: 07/06/96 - Let user specify tab character as table delimiter
via "/del=tab" commandline arg; added
tag handling (duh).
Send feedback to the programmer, Frank Brown, at:
CIS: 74766,3671
browf@well.com
http://www.halcyon.com/frog/fhb.htm
Frank Brown, 29-Jun-1996